IMPROVED QUERY EXECUTION IN QUERY PROCESSING SYSTEMS 



FIELD OF THE INVENTION 

The present invention is directed to an improvement in computing systems and in 
particular to improved query execution in query processing systems, such as relational data 
bases. 

BACKGROUND OF THE INVENTION 

In query processing systems, such as relational data base management systems 
(RDBMS), data values are extracted from stored representation of the data for further 
processing by the query evaluation system. Typically, and in RDBMS systems, the data is 
structured as rows comprised of column values. The rows are grouped into contiguous 
storage blocks known as pages. A part of query evaluation in such systems is the isolation 
of successive rows and extraction of a subset of the column values for the row. These 
values are used in query evaluation steps which may include one or more of filtering, 
sorting, grouping, joining, or other relational or data manipulation steps. Copying the data 
values from data pages involves the step of identifying and locating in main memory the 
page containing the row of interest, locating the row within the page and locating the 
column values within the row. The column values are then copied to new locations in 
memory where they are made available for query evaluation. Typically a page may be 
located in the main memory of a computer or located on a secondary storage device, 
typically a computer disk. In query evaluation systems which support concurrent query 
executions, the page containing data must be "stabilized" to ensure that it remains at the 
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same location in memory and to prevent concurrent updates to the page to preserve the 
logical integrity of the page contents, while it is being accessed by a particular process or 
thread. After copying column data values to a new location, the page stabilization is ended 
(released). The steps of locating the page, stabilizing the page, locating a row in the page, 
and releasing the stabilization for each row to be processed by the query evaluation system, 
may constitute a significant portion of the overall execution cost of a query. 

In the prior art, techniques are known to improve the efficiency of query execution 
by reducing the repeated location and stabilization of pages containing data. Efficiencies 
may be realized where a page is located and then remains stabilized for successive read 
operations. These approaches include search argument processing in which an evaluation of 
a retrieved row relative to a query predicate is performed prior to the release of the 
stabilization of the page containing the row. Where the row being retrieved does not meet 
the query predicate condition, the next row is retrieved from the page. This method permits 
multiple rows to be read from the page without releasing and reacquiring the stabilization on 
the page. However, where rows are located in the page which do satisfy the query predicate, 
the data values are copied for processing by the query processor and the stabilization on the 
page is released. Thus where the query predicate is satisfied, there is no increased efficiency 
resulting from the adoption of the above approach. 

It is therefore desirable to have a query execution technique which provides 
increased efficiency of the query execution where query predicates are satisfied by retrieved 
rows. 
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SUMMARY OF THE INVENTION 

According to one aspect of the present invention, there is provided a query execution 
in a query processing system having improved efficiency. 

According to another aspect of the present invention, there is provided a method for 
processing a database query in a database management system comprising a data manager, a 
set of data, a query processor, and a buffer, the method comprising the steps of: 

the query processor calling the data manager to request the return of data from the set 

of data, 

the data manager accessing the set of data to locate query-specified data and 
determining if the query-specified data is to be ignored, consumed, or returned to the query 
processor, 

where the data manager determines that the query-specified data is to be returned to 
the query processor, the data manager writing the query-specified data to the buffer, the data 
manager repeating accesi to the set of data until all data requested by the query processor 
has been accessed, and 

the query processor retrieving query-specified data jfrom the buffer. 

According to another aspect of the present invention, there is provided the above 
method, in which the set of data is stored on pages and the method further comprises the 
step of the data manager stabilizing the page on which the query-specified data is located 
prior to accessmg said data, the method fiirther comprising the step of maintaining the 
stabilization of the page while writing to the buffer and until all data requested by the query 
processor on the page has been accessed. 

According to another aspect of tiie present invention, there is provided a program 
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storage device readable by a machine, tangibly embodying a program of instructions 
executable by the machine to perform the above method steps. 

According to another aspect of the present invention, there is provided a computer 
program product for a database management system comprising a data manager, a query 
processor and a buffer for processing a database query a set of data, the computer program 
product comprising a computer usable medium having computer readable code means 
embodied in said medium, comprising: 

computer readable program code means for the query processor to call the data 
manager to request the return of data from the set of data, 

computer readable program code means for the data manager to access the set of data 
to locate query-specified data and to determine if the query-specified data is to be ignored, 
consumed, or returned to the query processor, 

computer readable program code means for, where the data manager determines that 
the query-specified data ik to be returned to the query processor, the data manager to write 
the query-specified data to the buffer, and for the data manager to repeat access to the set of 
data until all data requested by the query processor has been accessed, and 
computer readable program code means for the query processor to retrieve query-specified 
data from the buffer. 

According to another aspect of the present invention, there is provided the above 
computer program product, in which the set of data is stored on pages and in which the 
computer usable medium having computer readable code means embodied in said medium, 
further comprises computer readable program code means for the data manager to stabiUze 
the page on which the query-specified data is located prior to accessing said data, and for the 
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data manager to maintain the stabilization of the page while writing to the buffer and until 
all data requested by the query processor on the page has been accessed 

According to another aspect of the present invention, there is provided a query 
processing system comprising a data manager, a query processor and a buffer for processing 
a database query on a set of data, 

the query processor comprising means for calling the data manager to request the 
return of data from the set of data, 

the data manager comprising means for accessing the set of data to locate query- 
specified data and for determining if the query-specified data is to be ignored, consumed, or 
returned to the query processor, 

the data manager comprising means for, where determining that the query-specified 
data is to be returned to the query processor, writing the query-specified data to the buffer, 
the data manager repeating access to the set of data until all data requested by the query 
processor has been accessed, and 

the query processor comprising means for retrieving query-specified data from the 

buffer. 

According to another aspect of the invention there is provided the above query 
processing system, in which the set of data is stored on pages and the data manager fiirther 
comprises means for stabilizing the page on which the query-specified data is located prior 
to accessing said data, the data manager fiirther comprising means for maintaining the 
stabilization of the page while writing to the buffer and until all data requested by the query 
processor on the page has been accessed. 

Advantages of the present invention include a query execution technique requiring 
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reduced page stabilization and improved processor utilization. 

BRIEF DESCRIPTION OF THE DRAWING 

The preferred embodiment of the invention is shown in the drawing, wherein: 
Figure 1 is a flow chart illustrating the steps in query interpretation using the 

preferred embodiment of the invention. 

In the drawing, the preferred embodiment of the invention is illustrated by way of 

example. It is to be expressly understood that the description and drawing are only for the 

purpose of illustration and as an aid to understanding, and are not intended as a definition of 

the limits of the invention. 

DETAILED DESCRIPTION 

Figure 1 is a flow chart diagram illustrating steps in interpreting a query in 
accordance with the preferred embodiment of the invention. Start 1 0 represents queries, 
such as SQL statements, that are processed to access data in a database. In the example 
architecture of the preferred embodiment, queries are first processed by compiler 12. 
Compiler 12 provides, query processor 14 with an access plan based on the nature of the 
SQL query being compiled. As required, query processor 14 calls data manager (data 
management system or DMS) 1 6 to obtain access to data table 1 8 (the step of accessing data 
table 1 8 is sometimes termed a data scan). In the preferred embodiment records or rows of 
data are stored on pages in data table 1 8. In the preferred embodiment, rows of data table 1 8 
are available for copying to buffer 20. 

In query processing systems that support concurrent access to data, the location and 
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stabilization of a page containing data is a potentially expensive operation. Each time that 
data management system 16 stabilizes a page in data 18, and locates (using a notional cursor 
in the preferred embodiment) a row or record position in the page in data 18^ there will be a 
resulting time cost introduced in the processing of the query. The prior art includes 
techniques to allow multiple rows to be read from the same page in data 18 without 
releasing the stabilization or losing the location in the page. These techniques are typically 
applied, however, where column data for a row in the table of data 1 8 is not returned to 
query processor 14. Therefore, where a series of rows are read from the table of data 18 and 
column values for each row are, in fact, returned to query processor 14, there is a repeated 
time cost involved in data management system 16 repeatedly stabilizing the page and 
locating the relevant rows in the page. 

To improve query execution efficiency where row data is returned to query processor 
14, the preferred embodiment has buffer 20 that is available for use where rows are read 
from data 18 and are to tie returned to query processor 14. According to the preferred 
embodiment, buffer 20 will be used where such techniques as search argument processing 
do not prevent row data being returned to query processor 14. Whereas in the prior art, the 
stabilization of the page containing the row in data 1 8 is released when a row is returned, in 
the preferred embodiment the retrieved column data for the row is copied from data table 1 8 
to buffer 20, rather than being returned directly to query processor 14. The page 
stabilization is not automatically released. 

As will be apparent, use of buffer 20 incorporates the additional steps of copying 
data values in the query execution process. In certain circumstances, however, despite the 
added cost associated with the copying of data values to the buffer efficiencies are realized 
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by a reduction in page stabilization steps required. In addition, the code which is executed 
to implement the steps of copying data values to a buffer is anticipated to be significantly 
smaller than the code required to copy data values for use by query processor 14. As a 
result, instruction and data references are localized which results in improved processor 
utilization and therefore additional efficiencies in the query execution, itself For these 
reasons the use of buffer 20 potentially improves performance for a query execution. 

Use of buffer 20 may not be advantageous in all circumstances. Factors which 
influence the use of such a technique include the extent of resource usage and time cost 
inherent in the copying steps and the amount of data estimated to be returned from the data 
scanning access method. Where data found on each page is limited, the advantages of buffer 
20 are reduced. 

In the preferred embodiment, query processor 14 is designed to ensure that data is 
retrieved from buffer 20 in an appropriate manner. As will be apparent, data may be copied 
to buffer 20 in response io a query processor 14 called to data management system 16 in a 
manner which results in buffer 20 being partially filled, or in buffer 20 being completely 
filled. In the implementation of the preferred embodiment, where buffer 20 is completely 
filled, it is also possible for an "extra" set of data values (an extra row from the table) to be 
retrieved by data manager 16. 

Query processor 14 is implemented to process both the "extra" row and the rows in 
buffer 20. Where the access plan created by compiler 12 requires that the processing of the 
data maintain the sequential order of the rows, the "extra" row must be processed after the 
rows in buffer 20. To accommodate this, the preferred embodiment saves the "extra" row to 
a single row buffer (not shown) to permit these values to be copied to the locations in 
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memory for subsequent query evaluation operations. 

Where the data values returned by way of buffer 20 are of a size to precisely fit 
buffer 20, there v^ill be no "extra" row returned by the DMS. A flag must be set to permit 
the query processor 14 to recognize that rows exist in buffer 20 and to process these rows. 

The following pseudo code description sets out an implementation of the copying of 
a row from page 1 8 to buffer 20. 

DataCopySarg(RelevantCols/*refs to col vals of current row*/) 
if (BufferEmpty - TRUE) 
BufferUsed=0; 
BufferConsumed=0; 
BufferEmpty=FALSE; 
RowSize=sum(sizeof(relevantCols)); 
if(RowSize<=BufferSize-BufferUsed) 

Copy from relevantCols to Buffer[BufferUsed]; 
BufferUsed=BufferUsed + RowSize; 
retum( CONSUMED ); 
else 

retum( DONE ) //buffer full. Access method must copy 
//column values & return to query eval 

QueryEvalRowScannerO 
if( BufferEmpty =FALSE) 

if(BufferUsed>BufferConsumed) 
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Copy from Buffer(BufferConsumed) to HomeBuffers; 
RowSize=sum(sizeof(copied columns)); 
BufferConsumed = BufferConsumed + RowSize; 
else 

Copy from ExtraRow to HomeBuffers; 

BufferEmpty=TRUE; 

retum(ROW_AVAILABLE); 
else if(SeeEOF=TRUE) 

return(EOF); 
else // not non-empty buffer or EOF 

AccMethResult=accessMethod(getNext);if(AccMethResult=EOF)if(BufferEmpty 
= = FALSE) 
SeenEOF=TRUE; 

f 

Copy from Buffer(BufferConsiimed) to HomeBuffers; 
RowSi2e=sum(sizedof(copied columns)); 

BufferConsumed=BufferConsumed+RowSize; retum( ROW_AVAILABLE); 

else 
retum( EOF); 

else//not EOF from access method 
Copy from HomeBuffers to ExtraRow; 

The following table sets out the characteristics of the data objects referred to in the 
above pseudo-code description of the implementation: 
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NAME 



DESCRIPTION 



Buffer 



ExtraRow 

BufferSize 
BufferUsed 

BufferConsumed 

BufferEmpty 
SeenEOF 

HomeBuffers 
RelevantCols 



The multi-row buffer. The size of the multi-row buffer 
is optimised based on the number of rows expected, 
the sizes of the column data values, and the cost of 
dedicating memory to the multi-row buffer. 

Buffer for the "extra" row returned by that data scanning 
access method 

The size, in bytes, of the multi-row buffer 

The amount, in bytes, of the multi-row buffer currently 
containing data column values from zero or more rows 

The amoimtj in bytes, consumer from the multi-row 
buffer. Should be less than or equal to BufferUsed 

A flag indicating that the multi-row buffer is empty 

A flag indicating that the scanning access method 
has reported that there are no more rows 

The locations in memory designated to receive the 
relevant data column values of a scanning access 
method 

The locations in the current page of the scanning 
access method of the columns needed by the copying 
S ARG operator 



The foUov^ng table sets out details of certain procedures referred to in the pseudo- 
code example set out above: 



Create home buffers 



The home locations for the relevant column values of 
rows identified by a data scarming access method must 
be determined and communicated to both the data 
scaiming access method and the query processor. 
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Prepare HomeBuffers list 



The list of home birffers must be prepared for use by 
the copying SARG operation mechanism. 



Create Buffer and buffer 



The multi-row buffer must be created and the 



control information 



BufferEmpty and SeenEOF flags must be set to true 



and false respectively. 



Prepare SARG processor The information used to control the SARG processor 



As the above example implementation indicates, the code for use of buffer 20 is 
potentially limited and may run in a tight loop with both code and data kept in cache close to 
a computer processor. Fdr this reason, the use of buffer 20 for certain queries will improve 
performance of the processing of the query. 

Although a preferred embodiment of the present invention has been described here in 
detail, it will be appreciated by those skilled in the art, that variations may be made thereto. 
Such variations may be made without departing from the spirit of the invention or the scope 
of the appended claims. 



directives 



must be prepared to include the indication that the 



copying SARG operator is to be invoked if all the 



predicates are TRUE, 
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